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Abstract 



The splitting of a problem into aubproblews often 
involves the sane variable appearing in more than one of the 
subproblems. Thlo *ekes these oubprobletis dependent upon one 
another eince a solution to one may not qualify as a solution to 
another. A tuo stage method of splitting is described uhieh 
first obtains solutions by relaxing the dependency requirement 
and then atteapts to reconcile solutions to different 
eubproblems. The net hod has been realized as part of an 
automatic theoreoi prover programmed in LISP which takes advantage 
of the procedural pousr that LISP provides. The program has had 
success uith cryptar I thmetlc problems, problems fro* the blocks 
uorld. and has been used as a subroutine in a plane geometry 
theorem prover. 
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1* Introduction 

The splitting of a problem into subproblems often 
involves the same variable appearing in nore than one of the 
subproblems. This makes these subproblems dependent upon one 
another since a solution to one nay not qualify as a solution to 
another. This paper describes a two stage method of splitting 
which (1) first solves the subproblems as if thsg Mere 
indepsndent (I.e. completely relaxes the dependency requirement) 
and (2) then attempts to reconcile solutions to different 
subproblems. The advantages of this approach lie in the economy 
derived fro* making only a single pass at finding solutions and 
In the ability to use global knowledge about the solutions 
obtained In stage I to assist the search for a reconciliation of 

these solutions In stage 2* 

A poeeible disadvantage of this method is that it could 
be difficult to decide when to stop searching for unconstrained 
solutions to a subproblem in stage I, particularly uhen the set 
of euch solutions might be quite large- In order to make thle 
method work for an interesting class of problems, heuristics uera 
employed to prevent the search taking too long In cases where the 
number of solutions is infinite or very large- In particular, 
every case to a split need not be handled during the relaxation 
stage; some may be postponed to the reconciliation stage uhere 
constraints arising from the other cases can be brought to bear. 
The method has bssn realized as part of an automatic theorem 
prover programmed In LISP. The program has had success with 
cryptari thmet ic problems (91. problems from the blocks world 
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1131, and has bctn uiod as a subroutine in a piano geometry 
theorem prover (81 . 

The reconciliation stage beare a resemblance to 
constraint method* such ae used In 131. The reconciliation stage 
also can be thought of ae an attenpt to "debug" solutions uhich 
were obtained by first ignoring interactions betueen aubproblems 
and therefore is related to 1121* The overall approach to 
splitting is related to methods used In 12) where the 
investigation of a subproblen resulted in the assignment of a 
type to a variable appearing In that subproblen. Although the 
present procedure is oriented more to problems uhich aJ lou for 
explicit enumeration of solutions, it would not be adverse to the 
addition of a type theory mechanism. 

The logical formalism underlying the program Is presented 

in section 2 and some considerations of a semantic nature are 

discussed in section 3* The program also can make use of 

procedural knowledge as in ID, (21, 131, 141, 161, til) and this 

facility is described in section 4. The relaxation and 

reconciliation stages are examined in sections S and G 

respectively. Section 7 uill provide soms illustrative examples 

* 
of the program's psrformance. 

2. The Logical Deduction Rule* 

The reader is referred to (IB) for background material on 
predicate calculus theorem provera. It uill be assumed that the 
negation of the goal is to be regarded as a true formula, all 
formulas are In prenex form, and all sKistentlal variables have 
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been replaced by tkolom functions 110). Ue adopt the convention 
that E and E* always represent expressions which have been made 
Identical by the unification algorithm 110). The theorem prover 
has 35 ) te logical basis the following 11 rules of Inference: 
Rl, A problem is solved when it has been established that both 

the literals A' and -A are true. 
R2. Replace formula »A by A. 
R3. Replace formula A/\B by A, 8. 
R4. Replace formula ^lAvfl) by -A, *6 

(i.e. If one wishes to prove Avfl. then either 

prove A or prove B) > 

■ 
RS. Replace formula <v(A-*B) by A, +B 

(I.e. If one wishes to prove A-B t then 

assume A and prove B). 

■ 

RE. (Modus ponens) If It has been established that A* and 

A-*B are true where A is a literal, then add B* 
to the set of true formulas. 
R7. (Modus ponens) If It has been established that ~B' and 

'\ •?- are true where B is an atomic formula, then add 
*A* to the set of true formulas. 
R8. tflodus ponens) If It has been established that B" and 

A-*8 are true where B is an atomic formula, then 
add *A' to the set of true formulas. 
R3. (Reasoning by cases) If AvB is true, then split 

Avfi Into case A and case 8. If A and B arm 
independent (I.e. A and B have no variables in common), 
then break the problem Into two subprobleme by first 
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assuming A and then assuming B. The treatment then would 
be the same as in 171 where an intensive effort would be made 
to eolve a case before resorting to an additional case analysis 
and where a mechanism for curbing an explosion of cases ie 
employed. However, if A and B have a variable in common, then 
the case analysis is conducted as described in sections 5 and G. 
RIB. Replace formula ~(AaB) by -4wB. 
RX1. (Equality relation) If r - t and Air) are true where 

A(r) is a literal in which the term r appears, then each 
occurrence of r in A(r) ie replaced by the tern t. This 
treatment of equality is crude compared with other automatic 
theorem provers 151. (71 and most likely would inhibit 
thfl ability gf tht program to provt thsorsms In subject* like 
group theory or number theory. However, there ie nothing 
about the present program which would prevent the Introduction 
of the more sophisticated treatment of equality which this 
author employed in (71. 

The above rules are the sate as in (71 except ae already noted 
ulth respect to R9 and Rll- The control structure governing the 
execution of these rulee also is the same as in (71* A comparison of 
this system with resolution theorem proving 118} already has been 
presented In 17) and will not be repeated here* The main objective of 
the present paper is to present a better approach to splitting than was 

employed in 17) * 

+ 

3. So«s_Semantic. Considerations 

The present use of seaantics bears a resemblance to the 



7 



PACE E 

"model" strategies uhich have been used by resolution theorem 
provers 1101. It is based upon the duality that is associated 
uith different etatenents in the data base* For the statement *A 
can be regarded as asserting that *A is true (and hence A le 
false) or It can be regarded as asserting that ue ulsh to prove 
A. From the point of vieu of the format logic. It Bakes no 
difference uhich Interpretation you choose. But from the point 
of vieu of the underlying eemantice it makes all the difference 
in the uorld whether you believe A to be false or whether you 
believe that you actually can prove A. The commonly held belief 
that predicate calculus theorem provers reason only by 
"contradiction" Is rooted In the notion that the theorem prover 
either dose not eaploy semantics or else employs semantics uhich 
do not distinguish the denial of the etatement to be proved fro* 
other statements in the data base. 

The program divides statements Into "facts" and "goals.* 
Initially, all axioms are regarded as facts and the negation of 
the statement to be proved is regarded as a goal. Two statements 
are not allowed to Interact uith each other (as paired inputs to 
the same inference rule) unless one of the statemente Im either 
the initial goal or a descendant of the initial goal. The 
initial goal therefore comprises a set of support (141- Of 
course, there is some flexibility as to uhat actually constitutes / 

the statement to be proved* For example, the set of statements 
A, B> C P D, *€ coufd represent the Initial data base as uell as 
the eet of statements A, B t <v((CaD)-E) so far as the formal logic 
is concerned. Houever, the latter representation uould be 
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preferred if semantic consideration* Indicated that the 
hypotheses C and D were specifically related to the goal of 
proving E. In the latter representation. C and D would De 
generated as descendants of the initial goal through the 
application of rule RS followed by R3 whereas they would not de 
regarded at descendants of the initial goal in the former 
representation* 

Ae a new statement is generated by a deduction rule, the 
program makes a determination as to whether the statement is a 

fact or a goal* In rule R9, a case is regarded as a goal if and 
only If the case has a negation sign at Its front and the formula 
being split it a goal. The output of rule R19 is regarded as a 
goal if and only if the input to that rule is a goal* If *(A-tB) 
were a goal, rule RS would result m the output A treated ae a 

fact and the output -B as a goal. For all the other deduction 
rules, an output is regarded as a goal If and only If one of the 

inputs ie a goal and the output has a negation sign at its front. 

Thus, the output B' of rule RE would be a fact if A' were a fact 

since A-B as the second input to RE could not possibly be a goal. 
This separation into facts and goals can help to control 

interactions In the data base* Far suppose *A and *8 are goals 
and neither is an ancestor of the other; the program then 
normal ly would allow *A and *B to interact with each other but 
would not allou descendants of *A to Interact with descendants of 

-a- 

The symbol * has an important semantic role which has 
been neglected by resolution theorem provers. Aside from 
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assisting in the separation of statements into facts and goals as 
uas just described. Its presence serves as a cue for the program. 
Namely, the program interprets the statement A-»B in a manner 
which It different from the way it reacts to the logically 
equivalent statement -AvB, The case analysis of ~AvB involves a 
commitment by the program to prove A so that 8 can be concluded. 
Houever, rule RG applied to A-fl does not involve any commitment 
to prove A| It Just says that if A alreadu hai been regarded as 
true, then Me can conclude B. In particular, the program ui I I 

not conclude C-«B from the two statements C-*A and A-»B, 

* 

4. The _Usg_of Procedures 

Uhilfl the program ti not intendsd as a nsw programming 
language* it does have features uhich make it a meaningful 
■Ktonsion of LISP. The theorem prover is a LISP Function named 
DERIVE-: uhich operates upon three arguments (1) the initial list 
of formulas stripped of all quantifiers. (2) a list of variables 
unose values are desired as the answer to the problem, and 13) 
NIL or T depending upon whether or not the request is for just 
one ansuer or for all possible answers. If it is desired that a 
particular formula A be an Initial goal, then the formula would 
be coded as (GOAL* A). 

Uhenever a literal P(T1 Tn) or a literal 

~P(TI,.*«».Tn) has been established as true and is ready for 
processing by the deduction rules* Its predicate letter P is 
examined to see uhether it is atso a LISP function. If thm 
ansuer it yes. then P1T1, Tn) is evaluated and, depending 
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upon the output from this evaluation, there uould oe four 
possible responses! (1) if the ouput It T, the case under 
consideration is declared solved, (2) if the output is NIL, the 
literal is rejected and therefore Mould not be applied to the 
deduction rules, 13) If the output is a formula other than 

P1T1 Tn), then 1 1 replaces P(T1 Tn) and the resulting 

formula (i»ftt either Q or 4 depending upon whether the original 

literal was P<T1 Tn) or *P(T1 Tn)) uouJd await ite turn 

for processing by the deduction rules, and (4) otherwise* the 
deduction continues as If the predicate letter P were not a LISP 
function. It should be noted uith respect to the third 
possibility that the output Q does not have to be another atomic 
formula like P(Ti t , t(t ,Tn); also, Q is regarded as a replacement 
for P»T1 Tn) and not an a descendant of PIT1, Tn) , 

Since the theorea prover DERIVEX Is a LIS* function, it 
can be used as a subroutine by other LISP functions. In 
particular, a LISP function that uses it tight also be a 
predicate letter so that recursive applications of DERIVEX are 
possible. The theorem prover ui II examine function letters and 
Mill execute them If they are also LISP functions* The 
difference between the procedural treatment of function letters 
and predicate letters la that the former are executed immediately 
whereas the latter are executed only when appearing in a detached 
literal which Is ready for processing by the deduction rules. 

The formula A**B would be represented in LISP as 
{IMPLIES A BK However, In that for* it never would be used as 
an input to any of the the modus ponen* rules (i.e. RG, R7 or 
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R£). If ong wanted to a I low for a possible appl i cat ion to rule 
R6. then it Mould be coded as HHPUES (ROUTINE* A R) B) where R 
is some LISP function whose purpose is to screen out undesirable 
literals A" from application to rule RG. If one wanted to allow 
for a possible application of A-*B to all three nodus ponens 
rules, then it would have to be coded in the forn (IHPLIES 
[RDUTINEX A Rl) IROUTINEX B R21) where the LISP functions Rl and 
R2 would likewise help filter out unprofitable applications of 
these rules. The use of ROUTINEX is at the discretion of the 
human programmer. The filter would be disengaged by a choice of 
Rl and R2 which always evaluated as T, An example of its 
nontrivial use is given in section S together with other 

facilities for representing kponlecfyt In the fgrm of procedures* 

* 

5. Thr- _Rc ?ax f it i on _5 Thrift 

Since the relaxation stage involves searching for "all" 
solutions to a case, one sight object on the grounds that such a 
search could take too long (and in fact might never be 
completed). However, the search does not have to be for all 
possible solutions but rather for all solutions that can be 
obtained when certain heuristics are employed. Moreover, this 
search need not be conducted for every case generated by the 
split, as ul 1 1 be shown below* The degree of success should 
depend upon the problem domain and the individual heuristics. 
The program in 171 searched for "all - solutions to the first cose 
of a split and yet performed quite creditably In domains such as 
number theory and group theory. 
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Suppose the program wish©* to split the formula A(x)vfHx) 
into cases. Although all existential variables were removed at 
the outset by the skoleairatlon process, the variable x. uhich 
appears in both cases* uill be called (as in (71 i an 
"existential' variable since the object it to find some value of 
k which Mill permit a solution to each case. First. An attempt 
is wade to find values of x uhich Mill solve the problem under 
the assunption that Alxl le true. After all the successful 
values of x are recorded, the sane procedure is applied to BU) 
with x still treated as a variable and not constrained only to 
those values that yielded solutions to ths previous caoe A<xi. 
Suppose that the formula Bly)*(C(ylvD(y)) is part of the data 
base for case Blx). Then, the application of B(yl**(C(y)v01yH 
Mith BU) to rule R6 Mould natch the variables x and y and 
generate the formula C(x)vGMx). The formula C(x)vOU) uould 
become a candidate for splitting uithin the scope of case B (x) 
and the program uould not hesitate to attempt such a split 
(unless a solution of Blx) could be found uhich did not require 
the assignment of a specific value to some existential variable 
appearing in B(x)l. Houever* suppose that instead of 
B<y>-*(C(y)vD(yH t the formula Bty)-(C(y.z)vD(y. z» > had appeared 
in the data oase unere z is a variable that does not have an 
existential interpretation. The application of this formula with 
Blx> to rule RS would generate the formula CU,z) vD(«,zl but in 
order to split this latter formula an existential interpretation 
uould have to be created for the variable z* Experience **ith 171 
and the present program has indicated that it is usually not a 
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good idea to allow a split if (1) ons or nore existential 
variables appeared already in the hypothesis of the the case 
under consideration and (2) the new split would create one or 
more additional existential variables. So Me did soaethinn; to 
prevent this additional splM. One method la to use the filter 
mechanlsw of section 4 by coding Bly)-*(C(y,zl vOly.z) \ as (IMPLIES 
IROUTINEX B(y) R) (OR Cly.z) 0(y,x))» where routine R would 
reject any formula that possessed an existential variable. 
Alternatively, there Is a global variable which is zero or one 
depending upon whether or not a case analysis is being conducted 
that already has generated existential variables! therefore, the 
routine R might reject a fornula If this global variable had the 

value zero, Tht program also uould reject such a split 
automatical ly if there were nested above it a previous split 
which violated the above principle regarding the introduction of 
existential variables (i.e. even if the filter alloued the 
program to attempt the split of C(x,z)vO(x»z) in order to solve 
one of the cases of A(xlvBU), it uould never attempt a split of 
say E(w)vFlw) Involving another new existential variable u if 
this latter formula were generated from either case CIx.xl or 
case DU,ilK 

A LISP function called RECWSIVEX enables the human user to 
inform the program that a particular case should be postponed 
until after the rest of the split has been solved; when this case 
finally ie attenpted, It is with values that ars known to satisfy 
the other cases of the split. Thus* a split of the formula 
A(x)vRECURS]VEXtBiw})vCU) would result in formulas B(K1), 
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8(K2) BKn) being added to the data case wnere Kl, 

- 

K2..>***Kn represent the different solutions for x that uere 
obtained from spl I tt ing Alx)vC(x) . These formulas 

B<K1 > B(Kn) then would De available for use In an attempt to 

split sone other formula Oly)vE(yK 

The motivation behind the UM of RECURSIVEX it that there 
might be semantic reasons for believing that a particular formula 
B(k) wight be either (11 too "expansive* if one of its 
existential variables * is aMoued to be unconstrained (i.e. the 
hypothesis Blx) night generate too nany formulas if x is 
unrestricted) or 12) B(*> night lend Itself to recursive use in a 
manner that uould oafce the above heuristic governing the 
introduction of neu existential variables appear too severe a 
restriction- If either of these conditions prevailed, it uould 
be more appropriate to code BIx) as RECURS! VEX (BU) ) . 

Another LISP function called TESTK postpones action on a 
case until the reconciliation stage (as opposed to RECURSIVEX 
which postpones action until after the reconciliation stage ie 
completed). By coding BIx) as TESTX(Blx)) the human user informs 
the progran that BN) is to be executed as a LiSP function 
whenever a value is assigned to any existential variable x 
appearing in B(x)j the assignment of such a value c to x would be 
rejected if the execution of Bid returned a value of NIL, TESTX 
aeans tne case is to be used to test rather than generate neu 
assignments to existential variables. For example, x > 3 
nor*al ly uould be coded as TESTXIw > 9) since ue do not wish to 
generate all the positive integers as values for x but only to 
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test uhether a particular value it a positive integer, 

6* Jhe Reconci I iation Slant 

Throughout this section the word "variable" ui I I refer to 
existential variable. The input to the reconci t iat ion stage 
consists of a list of alternative solutions tor each case* Each 
alternative solution to a case is represented at a list of 
attribute-value pairs uhere each attribute is either a variable, 
the symbol RECURS1VEX1 or the synbol TESTX1. If the attribute is 
a variable, then the value to uhich it is paired is its 
assignment for thie particular solution. If the attribute is 
either RECURSIVEX1 or TESTX1, then It Is paired uith the 
hypothesis of a case still to be solved as it corresponds to the 
application of RECURSIVEX or TE5TX respectively as described in 
section 5. 

For each variable x, the program first constructs a 

constraint set of values consistent wi th the results of the 

i 

relaxation stage. This constraint set is defined as the 
intersection of U(x,c) taken over all cases c uhere Utx.cl is the 
set of possible valuts for x as determined from the various 
solutions to case c. Thus, if a solution to case c did not 
impose any restriction on the variable x, then U(x.c) uould be 
the universal set consisting of all elements. Also, U(x,c) would 
be taken as the universal set if a solution of x for case c 
depended upon another existential variable y (e.g. x - fly) uhere 
t Is a skolem function). Otherwise, U(x,c> is just the union of 
all valuts of x obtained from all the solutions to case c. By 
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computing the constraint tot for each variable, the program it in 
d position to (II abandon the reconciliation search if the 
constraint set for to*e variable Is empty and (2) reject a neuly 
computed value of a variable If this value it not consistent with 
at least one value fro* the constraint set of that variable* 

Newt, the progran makes a determination as to the order 
In which cases are to be reconciled. First priority It given to 
cases with the feuest nunber of alternative solutions. In the 
event of a tie, the cats is sslected which involves the feuest 
number of "unbound** variables uhere a variable is said to be 
"bound" if its constraint set consists of only a single element. 

Once a case is selected* its first alternative solution 
(consisting of a list of attribute-value pairs) is examined. 
Each attribute-value pair it processed in turn. Any variable 
appearing In the list structure formed by the attribute-value 
pair is replaced by its assigned value if such an assignment 
already had been nade from ions previous case* Let (A B) be the 
resulting attribute-value pair after these replacements (if any) 
tyQvB been made. If A is the syebol TESTX1, then the expression B 
would be evaluated and the current solution rejected should th* 
evaluation return NIL. If A la neither the symbol TESTX1 nor 
RECURS1VEX1 and either 11) A li a variable which appears as a 
subeiement of B t (2) B is a variable which appears as a 
tubelement of A. or (3) neither A nor B are variables, then the 
theorem prover would attempt to prove A - B and would reject 
the current solution to the case If it failed in the attempt. If 
It succeeds in this attempt. It would return any values it might 
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have found for the variables in question and then uould continue 
uith tho next attribute-value pair of the current solution. 

Suppose A Mere a variable which was not a subeleraent of 
B. If B * Bix) depended upon exactly one variable * and the 
constraint set of * were finite, then each value k from the 
constraint set of x uould be tested to see whether 3<k) belonged 
to the constraint set of A- If no value BOO so belonged, then 
the current solution uould be rejected since it uould be 
impossible for A to equal B. If exactly one Blkl belonged to the 
constraint set of A, then this (x k> uould be treated as a neu 
ottrioute-value pair (i.e. this k uould oe assigned as the value 
of variable "K The program then uould proceed as i f 8 did not 
depend upon just one variable- In particular, all previously 
processed attribute-value pairs uould be examined to see uhether 
they contained occurrences of the variable A. All such 
occurrences of A uould be replaced by B and any pairs so involved 
would once again become candidates for processing in the sa«e 
manner as has Just been described. 

If all the attribute-value pairs were processed 
successful ly, then another case uould be generated for 
reconciliation, A solution to the split uould be obtained if no 
more cases remained to be reconciled* If (1) this split had only 
a single solution as its objective and (2) the solution that uas 
just found did not necessitate the execution of another formula 
by virtue of a postponement through the use of RECUR51VEX. then 
an exit uould be Bade from the reconciliation stage; otheruise, 
the program would record this solution (even if it includes a 



PAGE 17 



formula uhose execution has been deferred) and then uould 
backtrack In a search for additional solutions. 

Uhen a solution to a case has been abandoned, the next 
alternative solution uould be examined* If there are no wore 
alternative solutions to a case, the program uould backtrack to 
the previous case and exaetne its next alternative solution* An 
exit uould be made from the reconciliation stage uhen the first 
case has no more alternative solutions to be examined. 

7. 5o°e Illustrative E«at*pie B 

■ 

This section ulll present an in depth description of tuo 
examples (run on a POP-IB computer) which should illustrate "any 
of the features of the program. Although the program has had 
success uith other problems, such as provided by the H.l.T* 
blocks uorld (e*g« in a matter of seconds It can find the largest 
red cube or all arches on the table from situations as complex ae 
those considered in 1131), their description uould not shed much 
more light on the way the program operates* Also* since the 
present program represents an improvenent of an earlier theorem 
prover 171 uhich uas successful in group theory and number 
theory, there is reason to believe that the present program uould 
have as much success in these same domains if given a comparable 
equality rule to uork uith. 

First, s feu LISP functions that are used in these 
examples ulll be explained. niMJSK. PLUSX, and TIRESX aro 
arithmetic functions uhich correspond to subtraction, addition, 
and multiplication respectively. H1NU5X is a function of tuo 
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argument* defined by tfllNUSX ri y) equals IPLUSX H (TltlESX -1 y) ) * 
PLUSX and TlflESX are functions of an indefinite number of 
arguments whose purpose is to carry out addition and 
multiplication in the presence of variables to which numerical 
values pay not yet have been assigned. Thus, (PLU5X 3 -2 4) 
returns the value 5 but (PLUSX 3 x M would return the value 
(PLUSX x 7) 1 f no numerical value had been assigned to Xi The 
expression 

(TlflESX 3 (PLUSX ITinESX 2 x) y 4 iniNUSX z (TlflESX 2 *))>) Mhen 
evaluated would return (PLUSX (TWESX 3 y) (T1HESX 3 z) 12). CEX 
is a function of two arguments* (GEX x y) returns (CEX x y) if 
either x or y does not have a numerical value* Otherwise. (GEX x 
y) returns T or NIL Depending upon uhether or not x is as large a 
number as y, 

7.1 dissionariee and Cannibals 

This task .31 has three missionaries and three cannibals 
who wish to cross a river from the left to the right side. Their 
only means of conveyance is a boat uhich has a capacity of tuo 
people. Any of the missionaries or cannibals is capable of 
operating the boat either alone or with someone else. The 
problem is to determine a uay by uhich all the missionaries and 
cannibals can be transported to the right side without ever 
allouing the cannibals to outnumber the missionaries on any one 
side since otherwise the missionaries on that side would bs 
eaten* 

Let (LEFT ex ny) represent the assertion that it is 
possible to transport all cannibals and Missionaries to the right 
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side given thai the boat |i on the laf t side together ul th ex 
cannibals and ey Missionaries. Let {RIGHT ex ey> represent the 
assertion that tt "9 possible to transport all ■issionarles and 
cannibals to the right aide given that the Doat is on the right 

aide together uith ex cannibals and *y Missionaries. The initial 
data base for this problem consists of the following three 

formulas uhere x and y should be interpreted as referring to the 
number of cannibals and Missionaries respectively being sent 

across in the boat: 

1. (GOALX (NOT (LEFT 3 3D) uhich it the initial goal of proving 

(LEFT 3 3). 

2. (IMPLIES (AND (TESTX (GEX ex x) ) 

(TESTX IGEX ng y)> 

(OR (EQUALS x 0) (EQUALS x 1) (EQUALS * 2)1 

(OR (EQUALS y B) (EQUALS y 1) (EQUALS y 2)> 

(OR (EQUALS x (niNUSX 1 y)) (EQUALS « (MINUSX 2 y) )) 

(OR (ANO (EQUALS ex xl (EQUALS ng y) ) 

(AND (OR (EQUALS « (PLUSX c* (MINU5X g my))) 
(EQUALS ny y) 
(EQUALS y miNUSX -y 3)1) 
(RECURSIVEX (RIGHT Ml HOB* 3 (MINUSX ex n) ) 

(MINUSX 3 (MINUSX ay y) > 

))))) 
(ROUTINEX (LEFT ex my) TRUEX)) 

3. (IMPLIES (AND (TESTX (GEX ex x) 1 

(TESTX (GEX >y y>> 

(OR (EQUALS x 8) (EQUALS x 1) (EQUALS x 2)) 

(OR (EQUALS y 0) (EQUALS y 1) (EQUALS y 2)) 

(OR (EQUALS x (MINUSX 1 y) ) (EQUALS x (MINUSX 2 y) ) ) 

(OR (EQUALSX x (PLUSX ex (MINUSX y »y) > > 

(EQUALS ny y) 

(EQUALS y (MINUSX my 3))) 
(RECURSIVEX (LEFT (MINUSX 3 (MINUSX ex x) 1 

(MINUSX 3 (MINUSX my y) ) ) ) ) 
(ROUTINEX (RIGHT c. ng) TRUEX)) 

TRUEX Is a function of one argument uhich aluays returns 

ft value of T, Its purpose is to disengage the modus ponens 

filter. The statement (GEX ex x) says that the number of 
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cannibals to be sent in the boat cannot exceed the number 
available on that side of the rlviTl houever, by coding thi* 
statement as ITESTX (CEX ex xl) the human programmer has directly 

informed the program that (CEX c* x) is to be under the control 
of TE5TX as described in section 5. The statement (OR (EQUALS ■ 

miNUSX 1 yl) (EQUALS H IHINUSX 2 gtH Hue that the combined 
number of cannibals and Missionaries to be sent In the boat must 
b# either 1 or 2. The satisfaction of (AW (EQUALS ex x) [EQUALS 
my y>> In formula 2 means that the problem uould be solved since 
all remaining missionaries and cannibals uould be in the boat and 
on their uay to the right side of the river. The statement (OR 

(EQUALS x (PLUSX c- (HINUSX y my))) (EQUALS my y) (EQUALS y 

(MINUSX ny 3))) says that aither an equal number of cannibals and 
missionaries must be left oehind on shore after the boat departs 

(i.e. ex - x * ny - y) or the number of missionaries being left 
behind ()*e, ny - y) Is either or 3s this expresses the 
requirement tnat missionaries cannot be in the presence of a 
greater numoer of cannibals. The statement (RIGHT IHINUSX 3 

(HINUSX ex x)} miNUSX 3 01INU5X my y)l) says that the problen 
nou must be solved from a starting point uhich has the boat 

together ulth 3 - lex - x) cannibals and 3 - imy - y) 

mi 5ss lonar ies on the right side of the river; by coding this 

statement uith R£CLft5|V£X« the human program&er has direct ly 

informed the program that the statement is to be under the 

control of RECURSIVEX as described in section 5. 

The program initially pairs formulas 1 and 2 to rule R7 

for ex • ny * 3 and then, after applying the output to rule RIB, 
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It attempts to split the resulting formula. This split results 

in the generation of the three additional formulae INOT (RIGHT 1 
0»), (NOT (RIGHT 2 BJ], and INOT (RIGHT 1 1M uhich in turn 
become candidates for application to the deduction rules. The 
processing of (NOT (RIGHT 1 8)) generates (NOT (LEFT 3 3)1 uhich 

is rejected eince it is an instance of a previous formula. The 
processing of (NOT (RIGHT 2 0)) generates (NOT (LEFT 3 3)1 (which 
again is rejected) and also generates (NOT (LEFT 2 31). The 
program continues in this manner and obtains a proof in three 
minutes. In evaluating this performance, it Is uell to Keep in 
mind that the missionaries and cannibals problem lends itself 
more to a representation based upon GPS type operators (91 rather 

than upon the predicates of automatic theorem provers, 

7.2 Cryptarithmetlc 
This task 131, 13) involves the assignment of a decimal 
digit to each of the letters of three uords so that the sum of 

the flret tuo uords equals the third word. No digit may be ./ 

assigned to more than one letter and no uord may have zero 
assigned to Its flret letter. 

As a sample problem, consider OONALD + GERALD - ROBERT 

19). Although the program solved this problem Mi thou t hints. It 
uill be more illustrative to examine hou the program attacked 

this problem uhen provided u\ th the sane hint of » S that had 
been given elseuhere 191 to human subjects. This problem te 
represented by the fof lowing formula uh^rm Kl through K5 are the 

"carries* from one column to the next uhich the program must 
determine. 
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1. (GOAL* INDT (MO 

(QH (AND (EQUALS (PLUSX D Dl T) (EQUALS KS 8) ) 

(AND (EQUALS IPLUSX D D) (PLUSX 16 Til (EQUALS KB 11)1 
(OR (AND (EQUALS (PLUSX L L KS) Rl (EQUALS K4 0)1 

(ANO (EQUALS (PLUSX L L K5) (PLUSX 10 R) > (EQUALS Kfi 1)11 
(OR I ANO (EQUALS (PLUSX A A K4) El (EQUALS <3 0)1 

IANO (EQUALS (PLUSX A A KM (PLUSX 18 EH (EQUALS K3 1)11 
(OR (AtO (EQUALS (PLUSX N R K3) B) (EQUALS X2 0)1 

(ANO (EQUALS (PLUSX N R K3I (PLUX 10 Bl I (EQUALS <2 ID) 
(OR (AMD (EQUALS (PLUSX E K2I 0) (EQUALS Kl 0) I 

(ANO (EQUALS (PLUSX E K2I (PLUSX 10 01) (EQUALS Kl 1))) 
(EQUALS (PLUSX D G Kl) Rl 
(EQUALS D 51 
(GENERATORX A 9) 
(GENERAIORX B 9) 
IGENERATORX D 1 9) 
(GENERATORX E 9) 
(GENERATORX G 1 9) 
(GENERATORX L 9) 
(GENERATORX N 91 
(GENERATORX 91 
(GENERATORX R 1 91 
(GENERATORX T 91 
(TESTX (QISTINCTX A B E G L N R Tlllll 

The statement (DISTINCTX ABDEGLNDRT) In formula 

1 reflects the requirement that distinct digits must be assigned 

to distinct letters (i.e. DISTINCTX is a function of an 

indefinite number of arguments uhich returns a value of NIL or T 

depending upon uhether or not tuo of its arguments have identical 

values!. The ability to handle linear equations (by solving tor 

one variable in teras of the others) uas built into the predicate 

EQUALS- Also, the expressions of the for* iGEf*RAT0RX - y zl 

serve to generate alternative solutions to a case by assigning 

integer values to variable " ui th the values ranging fro* y to z* 
The LISP function GENERATORX uas used only to make the problem 
statement nore compact* The problem was solved at about the same 
speed uhen statements like (GENERATORX A 8 9) were replaced by 

(OR (EQUALS A 8) (EQUALS A 1) (EQUALS A3)). 

The program is asked to find values for the variables A, 
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B, 0, E. G, L, N, 0. R and I that will result in the attainment 
of the goal- It begins by applying formula 1 to rule R18 and 
then attempts to split tht resulting formula. After completing 
the relaxation stage (i.e. solving the different cases 
independently), it constructs a constraint eet for each variable 
and determines an order in which the different cases are to be 
reconciled. The following description of the program's attempt 
to reconcile the different solutions invites comparison with 
human protocols recorded in IS). Although sone amount of brute 
fores is intrinsic to the problee, the progran Is able to use Its 
knowledge about the nature of these solutions (e.g. they are 
distinct integers constrained by the results of the relaxation 
stage) to guide the search in an effective manner. 

First, it concludsi 0-5andR-D+G+Kl since these 
were the only solutions generated by their respective cases. 
Therefore, R - 5 + G + Kl* It then selects the case that had 
generated the two alternative solutions <1) T m 2*0, K5 - 8 and 
12) T m 2*0 W 10. K6 » 1. The first alternative solution T - 
2*D, K5 - 8 If rejected since It would iftply T m 2*5 - 10 which 
does not fall within the constraint set for T (i.e. one of the 

integers through 9). Therefore, K5 - 1 and T - 2*0 - 18 - 2*5 

* 

- 10 - 8. The next case selected had generated the two 
alternative solutions (II Kl - 0. E - -K2 and (2) Kl - l t E - IB 
* K2. It first assumes Kl - 8, E - -K2 and notices that E is 
expressed in terms of a single variable K2* It therefore looks 
at the constraint set of K2 which consists of only the values 8 
and i. However, it rejects the value K2 - 1 since that would 
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imply E - -1 which falls ou'side the constraint set tor E» It 

therefore concludes that <2 - 9. But this too is quickly 
rejected since it would imply E * 8 ■ T uhich violates the 
condition (activated by the LISP function 01STINCTX) that no two 

letters may have the sane decimal digits. Therefore. Kl * 1, E - 
18 - K2 and hence R-5 + G+K1 - G + G. Once again, i t looks 
at the constraint set for K2 and rejects the value K2 - 9 since 

It uould imply E - 10 - <2 - 10 uhich falls outside the 
constraint set for E. Therefore, K2 - 1 and hence E - 10 - K2 - 

a. •' '. '/ 

The next case selected had generated the two alternative 
solutions U> K3 - 8. E - 2*A + K4 and (2) K3 -1, E - 2*A + K4 * 
10. It first assumes K3 - t E ■ 2«A + K4 and hence 2»A + K4 - 3 
since E - 3* The theorem prover then proves 2*A + K4 ■ 9 by 
solving this linear equation for the variable A. It gets A ■ 4,5 

- ,5*K4 uhich suggests that it examine the constraint set of K4 
since A Is enpreescd in terms of the single variable K4. The 
value K4 - is rejected since A - 4.5 then uould fall outside 
the constraint set for A. Therefore. K4 - 1 and hence A - 4, It 
next selects the case that had yielded the tuo solutions (1) K4 - 
0. R - Z*L * K5 and (2) K4 - 1, R - 2*L + K5 -10 and rejects the 
first alternative since it contradicts K4 - 1. Therefore, R - 
2«L + K5 -10. Houever, since it already has deduced K$ - I and R 

- G + G, it concludes G + G - 2*L - 9 uhich uhen solved for L 
gives L - 7-5 + .5*G. It then selects the case that had yielded 
the tuo solutions (11 K2 - 0, B - N + R + *C3 and (2) K2 - 1, B - 
N + R + K3 - 10 and rejects the first alternative since it 
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contradicts K2 - X, Therefore. B - N * fl + K3 - 10. H0U*V«r, 
since It already bel ievet K3 - 3 and R - C + G, it gets & - U + C 
- 4- Newt, the program examines the case that had produced the 
integers 1 through 9 as possible values for fl. The value R • 9 
is rejected since this value already was assigned to E. The 
value R - S is quickly rejected sine* it results inC-R-6-2 
and hence L » 7,5 + ,5*G - S.5 which falls outside the constraint 
set of L. However, the choice R - 7 is accepted aa it results in 
C ■ 1, L - 8, and B-N + G-4-N-3. The program continues 
in this manner and eventually finds - 2, N • 6, and B » 3 which 
solves the problen. The total elapsed time uas *8 seconds. It 
took the progran five minutes to complete the more difficult 
problem of finding the solution without the hint of D - S. 

- 
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